SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Tsien et al . , Roger Y. et al. 

(ii) TITLE OF INVENTION: LONG WAVELENGTH MUTANT FLUORESCENT 
PROTEINS 

(iii> NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson P.C. 

(B) STREET: 4225 Executive Square, Suite 1400 

(C) CITY: La Jolla 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 92037 



(v) COMPUTER READABLE FORM: 
p> (A) MEDIUM TYPE: Floppy disk 

ffft (B) COMPUTER: IBM PC compatible 

V (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

p (D) SOFTWARE: Patentln Release #1.0 , Version #1.30 

v.'§ 

(vi) CURRENT APPLICATION DATA: 
M< (A) APPLICATION NUMBER: UNKNOWN 

(B) FILING DATE: 1S-AUG-1997 

(C) CLASSIFICATION: 

~^(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Haile, Lisa A. 

(B) REGISTRATION NUMBER: 38,347 

~ a% (C) REFERENCE / DOCKET NUMBER: 07257/ 056001 

«S (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619/678-5070 
til (B) TELEFAX: 619/678-5099 



f2% INFORMATION FOR SEQ ID NO : 1 : 

■cajr 

t'U (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 716 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..714 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG AGT AAA GGA GAA GAA CTT TTC ACT GCA GTT GTC CCA ATT CTT GTT 48 
Met Ser Lys Gly Glu Glu Leu Phe Thr Ala Val Val Pro lie Leu Val 
15 10 IS 

GAA TTA GAT GGT GAT GTT AAT GGG CAC AAA TTT TCT GTC AGT GGA GAG 96 
Glu Leu Asp Gly Asp Val Asn Gly His Lys Pbe Ser Val Ser Gly Glu 
20 * 25 30 

GGT GAA GGT GAT GTA ACA TAC GGA AAA CTT ACC CTT AAA TTT ATT TGC 144 
Gly Glu Gly Asp Val Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
sj 35 40 45 

ACT ACT GGA AAA CTA CCT GTT CCA TGG CCA ACA CTT GTC ACT ACT TTC 192 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
SO 55 60 

TCT TAT GGT GTT CAA TGC TTT TCA AGA TAC CCA GAT CAT ATG AAA CGG 240 
Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg 
«5 70 75 80 



« • 



CAT GAC TTT TTC AAG ACT GCC ATG CCC GAA GGT TAT GTA CAG CAA AGA 288 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Gin Arg 
8S 90 95 

ACT ATA TTT TTC AAA GAT GAC GGG AAC TAC AAG ACA CGT GCT GAA GTC 336 
Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 10S 110 

AAG TTT GAA CGT GAT ACC CTT GTT AAT AGA ATC GAG TTA AAA GGT ATT 3 84 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
US 120 125 

GAT TTT AAA GAA GAT GGA AAC ATT CTT GGA CAT AAA TTG GAA TAC AAC 432 
Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

TAT AAC TCA CAC AAT GTA TAC ATC ATG GCA GAC AAA CAA AAG AAT GGA 480 
Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

ATC AAA GTT AAC TTC AAA ATT AGA CAC AAC ATT GAA GAT GGA AGC GTT 528 
,Ile Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
j«s 165 170 175 

TC&A CTA GCA GAC TAT TAT CAA CAA AAT ACT CCA ATT CTC GAT GGC CCT 576 
f*Jn Leu Ala Asp Tyr Tyr Gin Gin Asn Thr Pro He Leu Asp Gly Pro 
«Ta 180 185 190 

CTT TTA CCA GAC AAC CAT TAC CTG TCC ACA CAA TCT GCC CTT TCG 624 



jS*C 

v*l 



Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

GAT CCC AAC GAA AAG AGA GAC CAC ATG GTC CTT CTT GAG TTT GTA 672 
s Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
B 210 215 220 

ISA GCT GCT GGG ATT ACA CAT GGC ATG GAT GAA CTA TAC AAA 714 
gfcr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
230 235 

Q 

SFA 716 

&= s 

H) INFORMATION FOR SEQ ID NO:2: 

ro 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ser Lys Gly Glu Glu Leu Phe Thr Ala Val Val Pro He Leu Val 
IS 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Asp Val Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 

Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg 
65 70 7S 80 

His Asp« Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Gin Arg 
85 90 95 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 12S 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 
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Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly 
14S 150 155 160 

lie Lys Val Asn Phe Lys lie Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

Gin Leu Ala Asp Tyr Tyr Gin Gin Asn Thr Pro lie Leu Asp Gly Pro 
180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 21S 220 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Ly3 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 



£3 (D) TOPOLOGY: linear 

SI 

h= 

(ix) FEATURE: 
^ (A) NAME/KEY: CDS 

Si (B) LOCATION: 1..720 

fH 

= (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
III 240 245 250 

©EC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
VMl Glu Leu Aso Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
|lS * 260 265 270 

(Ms GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
275 280 285 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
290 295 300 

TTC GGC TAC GGC GTG CAG TGC TTC GCC CGC TAC CCC GAC CAC ATG AAG 240 
Phe Gly Tyr Gly Val Gin Cys Phe Ala Arg Tyx Pro Asp His Met Lys 
305 310 315 

CAG CAG GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 288 
Gin Gin Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
320 325 330 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
335 340 345 350 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 384 
Val Lys Phe Glu Gly Asp Thr I^u Val Asn Arg He Glu Leu Lys Gly 
355 ' 360 365 

ATC GAC TTC AAG GAC GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
He Asp Phe Lys Asp Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
\> 370 375 380 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
385 390 395 

v 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
400 405 410 



V 
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GTG CAG CCC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
Val Gin Pro Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
41S 420 425 430 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC TAC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
435 440 445 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
450 455 460 

GTG ACC GCC GCC GGG ATC ACT CAC GGC ATG GAC GAG CTG TAC AAG TAA 720 
Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys * 
465 470 475 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 amino acids 
jssis (B) TYPE: amino acid 

«4 (D) TOPOLOGY: linear 

-f'S 5 

£3 (ii) MOLECULE TYPE: protein 

**i (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Li- 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
WJl 5 10 IS 

-Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

[p'S 20 25 30 

Isiu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
Ll 3S 40 45 

f b's 

§ CVs Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
O 50 55 60 

4>he Gly Tyr Gly Val Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 

£-j55 70 75 80 

f4kn Gin Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Asp Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Pro Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200* 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
2^0 215 220 

V 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys * 
225 230 235 240 



v 



